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ABSTRACT _ , , 

The stability and change of verbal, non-verbal, and 

total IQ scores from group tests were investigated for students ^ 
tested at Grades 1, 2, 4, 7, 9, and 11, Conclusions are discussed in 
detail, and the following recommendations are made: (1) 

intelligence tests should not be routinely administered in Grades 1 
and 2 unless the users and interpreters of these scores have received 
extensive training in their meaning and limitations; (2) Non-language 
and non-verbal IQ scores must be interpreted even more cautiously 
than verbal IQ scores; (3) Teachers and other users of IQ scores from 
group tests need to be made aware of the fact that even when there is 
considerable overall stability, scores of a few individuals will vary 
greatly; and (4) No meaning should be attached to differences in 
verbal and non-verbal IQ scores prior to Grade 7- (AG) 






I 





I 

••v'. 






fA 

r\j 




in 



wU 









FINAL REPORT 
Project No. O-H-024 
Grant . No . OEG-8-70-000 7(509) 



?/9 ^(/ 

■\W 



< - 
1: 




THE STABILITY AND QIANGE 
AND NON-LANGUAGE IQ 



OF LANGUAGE 
SCORES 



Kenneth D. Hopkins 
University of Colorado 
Boulder r, Colorado 80302 



December 1971 



U.S. DEPARTMENT OF 
HEALTH, EDUCATION, AND WELFARE 

Office of Education 
Bureau of Research 



U.8. DEPARTMENT OF HEALTH, 
EDUCATION ft WELFARE 
OFFICE OF EDUCATION 
THIS DOCUMENT HAS BEEN REPRO- 
DUCED EXACTLY AS RECEIVED FROM 
THE PERSON OR ORGANIZATION ORIG- 
INATING IT. POINTS OF VIEW OR OPIN- 
IONS STATED DO NOT NECESSARILY 
REPRESENT OFFICIAL OFFICE OF EDU- 
CATION POSITION OR POLICY, 



1 



EDO 58323 



Final Report 



Project No. O-H-024 
Grant No. OEG-8- 70-00 0 7 (509) 



THE STABILITY AND aiANGE OF LANGUAGE 
AND NON-LANGUAGE IQ SCORES 

Kenneth D. Hopkins 
University of Colorado 
Boulder, Colorado 



December 1971 



The research report herein was performed pursuant to a grant with 
the Office of Education, U.S. Department of Health, Education, and 
Welfare. Contractors undertaking such projects under Government 
sponsorship are encouraged to express freely their professional 
judgment in the conduct of the project. Points of view or opinions 
stated do not, therefore, necessarily represent official Office 
of Education position or policy. 



U.S. DEPARTMENT OF 
HEALTH, EDUCATION, AND WELFARE 



Office of Education 
Bureau of Research 




11 

2 



Table of Contents 



Summary 

Background for the Study 



1 




2 



Belated Literature 3 

Studies of IQ Stability from Guup Tests with 

School-Age Children 3 

Summary of the Literature Review 6 

Methods 7 

Findings H 

Stability of Verbal IQs 11 

Stability of Non-Verbal IQa 12 

The Stability of Total IQ Scores 14 

Graphic Representation of Sbabillty Coefficients.. 17 

General Conclusions Regarding IQ Stability 17 

Changes in IQ Scores 17 

Comparative IQ Stability for Boys and Girls...* 19 

Reliability of Verbal vs* Non-verbal IQ Differences*..** 25 

Associated Changes in Verbal and Non-verbal IQs >.**25 

Predictive Value of Multiple IQ Scores.*.*** •***•* 27 

The Multiple Regression Approach.**** 27 

The Use of the Average of Prior Scores as Predictors ..*28 

Factorial Composition of Verbal vs. Non-verbal IQ 28 

Conclusions and Recommendations 30 

Conclusions * 30 

Recommendations 32 



References 



33 



■ 




ERIC 



ill 

3 



Tables and Figures 



Table 1 Tear and Month of Testing , Mean Age In Months* 
and Test Administered at Each grade Level in 
Claaaea of '69 and '72 8 

Table 2 Means and Standard Deviations for the Verbal 

Non-verbal* and Total IQ Scorea for the Sample 
at Various Grade Levels 9 

Table 3 Means and Standard Deviations of Verbal* Non- 
Verbal* and Total IQ Scores for Students with 
a Complete Set of Scores across Grades 10 

Table 4 Stability and Generali sab ill ty Coefficients of 

Verbal IQ Scores for the Total Sample* 11 

Table 5 Stability and Generalisability Coefficients of 

Verbal IQs Corrected for Terminal Variability 12 

Table 6 Stability and Geaaralisability Coefficients for 
Verbal IQ Scores Corrected for Terminal 
Variability md Attenuation 13 

Table 7 Stability Coefficient of Non-Verbal IQ Scores 

for Total Sample 13 

Table 8 Stability and Generalisability Coefficients for 
Non-Verbal IQ Scores Corrected for Terminal 
VariabiUty 15 

Table 9 Stability and Generalisability Coefficients for Non- 
Verbal IQ Scores Corrected for Terminal 
Variability and Attenuation 15 

Table 10 Stability and Generalisability for Total IQ Scores 16 

Table 11 Stability and Generalisability Coefficients for 

Total IQs Corrected to a Coimnon Variid)ility 16 

Table 12 Stability and Generalisability Coefficients for 
Total IQ Scores Corrected for Terminal 

Variability and Attenuation 16 

Table 13 Means and Standard Deviations Algebraic and 
Absolute Differences Between Verbal IQs for 
Each Pair of Grade Levels 20 

Table l4 Means and Standard Deviations of Algebraic and 
Absolute Differences Between Non-Verbal IQs for 
Each Pair of Grads Levels 21 




iv 

.4 



Table 15 Means and Standard Deviations of Algebraic and 
Absolute Differences Between Total IQs for 
Each Fair of Grade Levels '•••22 

Table 16 Means and Standard Deviations for Each Sex at 

Each Grade Level 23 

Table 117 Stability Coefficients for Verbal | Non-Verbal 

and Total IQ Scores by Sex 24 

Table 18 Stability Coefficients for Verbal - Non-verbal 
IQ Differences for Students with a Complete set 
of Scores In Grades 2-11 26 

Table 19 Correlation Coefficients Between Changes In Verbal 
IQ Scores Across Time with Associated Changes In 
Non-Verbal IQ Scores 26 

Table 20 Correlation of the Average of Previous Tests Coispared 
to the Correlation of Only the Most Recent Teat In 
the Set 29 

Table 21 Correlations of Verbal and Non-verbal IQ Scores 

with References Axes 31 

Figure 1 Graphic Representation of the Stability Coefficients 
for Verbal, Non-verbal, and Total IQs at Certain 
Grade Levels with IQ at Grade 11 *18 



Acknowledgments 



The author Is indebted to several persons who made valuable 
contributions to this project. Glenn H. Bracht and Larry Nelson 
were responsible for most of the computer analyses. Beverly 
Anderson assisted in the review of related literature 2nd in the 
data collection. Nancy Gallagher edited and typed the final report. 
George Kretke and Socrates Rapagna assisted with various statistical 
computations. Gene V Glass made helpful suggestions regarding the 
factor analytic portions of the study. Eugene Gullette and the 
Boulder Valley Public Schools cooperated by providing access to 
testing information. All of these individuals deserve credit for 
their contribution to the project. 



SUMMARY 



The stability and change of Verbal » N6n-verbal» and Total IQ 
scores from group tests was investigated for students tested at 
grades 1» 2, 4, 9, and 11. The study provided empirical support 

for the following conclusions: 

1. Contrary to most textbook treatments of the topic and 
prevailing professional impression » there is little or 
no information on the stability of IQ scores from group 
tests especially (1) at tlie primary grades » or (2) for 

time intervals greater than three years, or (3) on currently 
used tests, or (4) on Non-verbal tests. 

2. Verbal IQs are much more stable than Non-verbal IQ scores, 
especially prior to junior high school. 

3. At grades one and two Verbal and Non-verbal IQs reflect 
relatively little lasting variance, even over short Intervals. 

4. By grade four. Verbal IQs show a very high relationship 
(about .8) with corresponding IQs thereafter. 

5. Non-verbal IQs at grade seveir reflect about the same degree of 
stability as Verbal IQa at grade four. 

6. Total IQs have a pattern of stability highly similar to that 
of Verbal IQ scores. 

7. IQ scores from group tests to be considerably less stable than 
Stanford-Binet IQs prior to grade seven, atter which there is 
little differences in stability. 

8. In the main, there is considerable change in true Verbal IQ 
scores prior to grade four, but little change thereafter. 

9. In the main, there is considerable change in true Non- 
verbal IQ scores in elementary school , but only minor 
changes thereafter. 

10. Differences in Verbal and Non-verbal IQs show meager 
stability over time prior to grade seven, after which there 
Is considerable stability. 

11. The use of IQ scores in addition to the most recent score 
offers little promise for improving the determination of 
eventual Intellectual status. Multiple regression can 
contribute slight Improvement in prediction, whereas the 
averaging of prior performances lessened the accuracy in 
prediction . 

12. The factor analytic study confirmed Informal observations made 
from a study of the general correlation matrix -- Verbal 

IQ vari an ceibe comes stable at about grade four. Non-verbal 
IQ variance stabilizes at about grade seven. 
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THE STABILITY AND CHANGE OF LANGUAGE 
AND NON-LANGUAGE IQ SCORES 



Background for the Study , 

Ever since the concept of an intelligence quotient, or IQ, was 
first applied to the measurement of Intelligence by Terman in 1916 
(Horrocks, 1964), the "constancy" of the IQ has been an Important 
concern of educators and psychologists. 

The question all too frequently Is discussed as a general 
topic, as if the findings on a given test generalized to all tests 
and all groups. For example, Bloom (1963, p, 383) has stated, 

.,, General intelligence as measured by either 
an Individual intelligence test or a group test 
becomes relatively constant or stable by 
about the second or third year of school, ,, after 
age eight the correlations between measurements 
over periods of six to ten years rarely 
are less than ,80 when the same or parallel forms 
of the same test are used In repeated 
measurements , 

Anastasl (1960, p, 232) also ccimnents , "Empirically, IQ's have been 
found to remain sufficiently constant during the elementary schools 
years to make predictions over several years feasible," Most 
studies on the topic, however, have been made using individual tests, 
usually the Stanford-Blnet ; relatively few have used group tests, 
except for college and adult subjects. No previous study employing 
group tests, the only type ever taken by nlnty percent or more of 
the population, has spanned the ten year Interval mentioned in the 
Bloom quote above; in fact, only one previous study had encompassed 
a time Interval of even six years and that was done years ago on a 
small, atypical sanq>le using an obsolete test. 

The literature search failed to reveal any study in which the IQ 
constancy on group language and/or non- language tests was Investigated 
for either normal or special reference groups that continued from 
elementary through high school. Since Inferences with long-range 
educational and vocational Implications arc Important functions of 
intelligence tests, it is apparent that educational and psychological 
personnel must be aware of the degree of accuracy of such Inferences 
made from these tests. 
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Non-language intelligence tests have been devised ^ in part^ to 
provide an intellectual assessment of pupils with atypical cultural 
and/or lingual backgrounds. It Is usually assumed that non-language 
assessment is more accurate for these types of pupils. The degree 
of accuracy of future predictions from these Instruments must be 
known before their value can be ascertained. 

The extent to which IQ scores reflect Innate potential or 
abilities developed through various environmental stimuli is not 
addressed In, nor relevant to^ the present study. Almost all pupils 
take several Intelligence tests during the K-12 school years » and 
Inferences are made from these scpres. The critical question addressed 
oy the present study Is how much do obtained IQ scores change over 
time? The accompanying long-term Inferences can be no more accurate 
than the degree of stability reflected in the IQ scores. 

Related Literature . 

The literature on IQ constancy was reviewed by Thorndike (1933, 
1940) and more recently by Bloom (1964) who presented a summary of 
many previous studies. These reviews have been devoted almost 
exclusively to research involving the Stanford-Blnet or adult 
subjects. 

The Stanford-Binet , a verbal intelligence test that requires 
individual administration by a highly trained psychologist or 
psychometrlst , and hence Is very expensive, is the only test on which 
the stability of the IQ has been extensively investigated. It is 
paradoxical that long-term stability is usually treated as a 
general topic, apart from any particular test, as If Stanford-Binet 
data generalized to all tests and groups. 

An exhaxistive literature search identified twenty-one research 
studies that contained at least fragmentary data on the stability of 
mental ability performance for some portion' of time during the 
developmental (pre-adult) period. Each of these studies is briefly 
described In chronological order. 

Studies of IQ Stability from Group 
Tests with School-Age Children . 

Garrison and Robinson (1925) fpund a very high correlation 
(.91 and .93) with 131 children in grades 3-6 in the Peabody 
Demonstration School between scores on the National Intelligence 
Tests and retests two (or three) years later. Descriptive data 
regarding variability was absent, although an average IQ of 133 was 
reported. It is unclear whether raw scores or IQ score were the 
basis for the correlation — a critical factor since the age-range of 
the sample was considerable. 

Hirsh (1930) (also Miller, 1933, using the same data) correlated 
grade- one Otis IQ scores obtained by 160 students with corresponding 
scores 1-5 years later. The correlation coefficients between grade 
one scores and grade 2-6 performance reflected an atypical pattern: 



.79, .77, .76, .82, .88, respectively. It is unclear why the standard 
deviation of grade six scores was so large • about 20 points. 

Lincoln and Wadlelgh (1932) studied 154 "regular pupils" tested 
In grades 3*^ with Otis Primary and retested 1-3 years later using 
either the National or Terman Intelligence tests. No descriptive 
data on the sample such as means or variability were given, nor was 
a correlation coefficient reported — only that the average difference 
between scores was about 7 IQ points. 

Lauderbach and Hahse (19 32) reported a correlation of .79 for 
a heterogeneous group of 150 pupils In grades 4-6 who were tested one 
year later on the now-obsolete McCall Multi-mental Scale. 

Traxler (1934) obtained correlations of .76, .67, and .68 for 
a non-representative group of ninth grade students at the University 
of Chicago High School retest one, two^ and three years later on the 
Otis Self-Administering Test. 

Seagoe (1934) studied pupils who were tested in grade one (Detroit 
Firs t-Grade)| three (Detroit Primary) , five (National Intelligence) and 
seven (Terman Group). Unexpectedly, the correlations between scores 
In grades one and seven (.77) was higher than for scores In grades 
one and three (.64). Other correlations ranged from .70 (3 with 
5) to .87 (5 with 7). 

Pinter and Stanton (1937) studied 59 students who took 
Thorndike's CAVD tests in grades three, four, and five. No data 
on means or variability was given, hence the represoitatlveness 
of the sample Is uncertain, but the stability coefficient reported 
were r * .68 (3 with 4) , r = .76 (3 with 5), and .82 (4 with 5). 

Goldin and Bothchlld (1942) investigated IQ stability on 174 
pupils tested In grades one, and four on the Pltner-Cunningham and 
the Henmon-Nelson. Tlie correlation was only .46. Additional 
Henmon-Nelson test scores were available on 54-60 of the same students 
in grades six and eight. The correlation coefficients between IQ 
scores in grades four and six and six and eight were In the .81-, 85 
range. Other correlations were not reported, 

Knezevlch (1946) found a correlation of ,70 between Henmon-Nelson 
IQ scores on 113 sophomores In a rural high school with corresponding 
scores two years later. The group was atyplcally homogeneous (standard 
deviation was only 8,1 points), hence the stability coefficient Is a 
conservative estimate. 

Doppelt and Bennett (1951) studies the grade 9-12 stability of the 
various tests of the Differential Aptitude Tests for 140 boys and 
girls. Stability coefficients ranged .59-. 78 for all tests except 
Verbal Reasoning, which had higher coefficient In the .84-. 86 range. 

No descriptive data were given for the sanqple. 
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Layton (1954) studied 2,169 students who were tested in grades 
9 and 12 with the American Council on Education ( ACE) test, and 
reported a correlation of, 80. This test is usually considered 
primarily an adiievement , rather than an aptitude test, and bears 
only indirectly on IQ stability. 

Collman and Newlyn (1958) studied 182 English pupils, ages 
6-15, in "schools for the subnormal" who took the Terman-Merrill 
Intelligence Test twice. The testing interval was one year, A 
correlation of ,93 was reported, unfortunately it was not made 
clear whether IQ scores or Mental Ages were correlated — a serious 
defect in view of the age range in the sample. An average IQ of 
68 was reported for the group, with a 42-89 range. 

Tyler (1958) studied 83 pupils who were tested with Primary 
Mental Abilities Tests (PMA) in grades one and four. Low stability 
coefficients were' .48, .42, .51, .53 for the Verbal, Spatial, Numerical 
and Perceptual Tests respectively, but a correlational ,70 
was reported for the total score. She also studied 127 pupils 
tested with the PMA in grades 4 and 8. Correlations of ,65, ,82, 

,41, .62, and ,66 were reported for the Verbal, Numerical, Spatial, 
Reasoning, and Total scores respectively. Note the low stfdjUity 
of the Spatial test. No descriptive data on the meanson variance 
were reported. 

Meyer and Bendig (1961) studied 100 eighth grade students who 
took the Primary Mental Abilities Tests (PMA) in grades 8 and 12 
Stability coefficients of ,81, .75, .73, .66 and .43 were reportevl 
for the Verbal, Reasoning, Numerical, Spatial and Word Fluency Tests 
respectively. The total scores had a stability coefficient 
of .82, Note the very low stability of the Word Fluency test. 

Notice also that these values are higher than those found by Tyler 
with the same time interval but at a younger age. 

Hopkins and Bibelheimer (1963) studied 404 students with 
representative means and variability who took the California Tes^ o f 
Mental Maturity (CTMM) (S-Form) in grades 3, 5, and 7. A parallel 
study of 314 pxjplls who took the CTMM was reported subsequently by 
Finley, Thompson, and Cognata (1966), Both studies found low 
stability coefficients (.50 or below) between Language and Non-language 
IQ scores at grades three with corresponding scores in grades five 
and seven,’ The grade-five-to-seven Language IQ stability was 
higher (r's of ,71-, 78) than Non-language IQs (r's of .47-. 52). 

In a subsequent study Hopkins and Bibelheimer (1971) obtained 
CTMM IQ data on 354 pupils from the same sample and found that grade 
eight IQs had about the same stability coefficients as the grade 
seven reported above, and that Language IQs tended to be more stidile 
over time than Non-language IQs. 



Bradshaw (1964) investigated IQ stability on 58 public school 
children tested in grades 2 and 4 with the California Test of 
Mental Maturity (CTMM) and found coefficients of .65, ,50, and .66 
for Language^ Non-language , and Total IQs respectively. No 
data on group variability were reported. 

TuUey and Hall (1965) investigated the one-year stability of 
the School and College Ability Test over the grade 9-10 period. 

Very high stability coefficients of .92, .88, and ,94 were reported 
for the Verbal, Quantitative, and Total Scores respectively. This 
test does not yield IQ scores and is usually considered to be more 
heavily loaded with academic achievement content than are most 
"Intelligence" tests and m^ have greater stability accordingly 
although little Is known about achievement stability. 

Eagle (1966) studied 115 boys and 150 girls who were tested 
with the Lorge -Thom dike Intelligence Test in grades 3 or 4 and 
retested in grades 7 or 8, He reported correlations of ,78 and 
,67 for Verbal and Non-verbal IQs for girls, and corresponding 
lower values of ,59 and ,49 for boys. Unfortunately, no data were 
reported on group variability. 

Lorge, Thorndike, and Hager (1966) reported several stability 
coefficients for certain groups of pupils for one to tliree-year 
intervals on the Lorge-Thomdlke Intelligence Test . The stability 
coefficients were larger for the higher grades. For example, grades 
3-6 correlations were about .5 for both Verbal and Non-verbal IQs, 
but the corresponding values were ,82 and ,74 for the grade 6-9 corre- 
lations . 

Summary of the Literature Review . 

Several general conclusions can be drawn from the literature 
survey. Seven of the 21 studies were conducted and reported prior 
to 1938 and pertain , for the most part, to obsolete tests. 

Five other utilized tests, although cognitive in nature, are not 
Intelligence tests In the conventional sense. Six studies employed 
currently used intelligence tests, but the largest tine interval 
Investigated was five years. 

Several of the studies failed to report data on the sanq>le's mean 
and variability. The latter being especially critical since It has 
a substantial affect onthe magnitude of the stability coefficient — 
other things being equal, the more heterogeneous the sample, the 
larger the stability coefficient. Even more serious is the fact that 
in some of the studies, it is not clear whether IQ scores, raw score 
or Mental Ages were correlated. The latter will have a much larger 
stability coefficient when the sample is not homogeneous in age. 

This factor probably accounts for the abnormally high coefficients 
reported by Garrison and Robinson (1925), Hirsh (1936), Collmann 
and Newlyn (1958). 
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Another confounding factor In analyzing previous research results 
from the fact that all IQ stability studies on group tests prior to 
1966 Investigated the stability of ratio IQ's. Almost all currently 
used tests use standard score deviation IQs. With ratio IQs one 
rarely can determine whether a sample's variability is representative 
or not. For example, prior to 1966 when the Stanford-Blnet used 
ratio IQ's, the standard deviation of the IQ scores was 12 points at 
age 6 but 20 points at age 12 ( on Form L) . This type of normative 
data Is not available On the group tests, hence one cannot make 
adjustments for atypical variability. For example, KnezevLch (1946) 
reported a standard deviation of only 8.1 points for the twelve-grade 
Henmon-Nelson IQ scores, In his sample, yet Hirsh's (1930) sample 
had standard deviation of 19.8 points on Otis IQ scores at grade 
six. It Is unfortunate that only six or so studies were adequately 
conducted and reported to be of value as background data against 
whlcli those from the present study can be related. 

METHODS 

The entire student population for two graduating classes 
('69 and '72) from a school district of approximately 30 schools 
and 20,000 enrollment In Boxider County, Colorado were enqxloyed In 
this study. This district's testing program has prescribed groip 
Intelligence tests be administered to all pupils In grades 1, 2, 4, 7, 
9, and 11 for more than a decade. All testing was conducted In 
Septenber or October — the mean age of the initial (grade one) 
testing was 78.2 months. 

Initially the results for the two gradiiatlng classes ('69 and 
'72) were analyzed separately and gave parallel findings, but both 
groups were subsequently combined to give greater stability and 
generallzablllty to the findings. 

Table 1 gives specific descriptive data for the two classes 
and the tests employed. Scores for any students who was present 
for at least two of the testing periods were retained In the analysis* 
The means and standard deviations of the Verbal (V), Non-verbal (NV) , 
and Total (T) IQ scores of all pupils retained In the sample are 
given In Table 2. Although It Is evident the sanple Is above the 
national average In central tendency, this difference Wwuid not be 
expected to seriously affect the stability coefficients since the 
sample's variability is comparable (slightly less) than the national 
parameters. 

The standard deviations in Table 2 are for all students In the 
sample at each grade level. The means and standard deviation were 
also computed separately for the 186 students present at each of the 
six testings. Table 3 gives these data. Notice that In roost cases 
the corresponding standard deviations dlfferby less than 1 IQ point, 
hence the resulting stability coefficients would not be expected 
to be seriously affected by differential variability resulting from 
student attrition and mobility. 
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Table 1 



Year and Month of Testing, Mean Age In Konths, and Test 
Administered at Each Grade Level In Classes of '69 and *72 



Grade 


Class 


Testing 
(Mo. /Year) 


• Mean 
Age (Mo.) 


Test Used 


1^ 


'69 


10/57 


78 


CTMM^, 1950 S-Form, Primary Level 


2 


'69 


10/58 


92 


CTMM, 1957 S-Form, Primary Level 




'72 


9/60 


91 


CTMM, 1957 S-Form, Primary Level 


4 


'69 


10/60 


116 


CTTM, 1957 S-Form, Elementary Level 




'72 


9/62 


114 


L-T^, Form A, Level 3 


7 


'69 


9/63 


150 


L-T, Form A, Level 4 




'72 


10/65 


151 


L-T, Form A, Level 4 


9 


'69 


9/65 


174 


L-T, Form A, Level 5 




'72 


10/67 


175 


L-Tj Form A, Level 5 


11 


'69 


10/67 


199 


L-T, Form B, Level 5 




'72 


10/69 


199 


L-T, Form B, Level 5 



^California Test of Mental Maturity. 

^First-grade IQ scores were not available for Class of *72. 
^Lorge-Thomdike Intelligence Test, 
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Table 2 



Means and Standard Deviations for the Verbal, Non-Verbal and 
Total IQ Scores for the Sample at Various Grade Levels 



Grade 


Test 


N 


Verbal IQ 


Non-Verbal IQ 


Total IQ 


1 


CTMM® 


AA6 X 


114.24 


105.68 


109 . 87. 






s 


12.56 


12.09 


10.45^ 


2 


CTMM 


995 X 


113.77 


110.95 


112.40 






s 


17.98 


17.83 


15.20 


A 


CTMM 


1629 X 


111.88 


111.71 


111.84 




and L— 1 


s 


16.08 


16.87 


14.74 


7 


L-T 


22 8A X 


112.64 


113.42 


113.03 






s 


14.14 


14.88 


13.28 


9 


L-T 


2166 X 


114.34 


118.99 


116.66 






s 


13.43 


14.49 


12.73 


11 


L-T 


1859 X 


113.71 


119.91 


116.82 






s 


13.50 


14.20 


12.47 



^Slnce the CTMM used a ratio IQ, the value of the standard deviation 
is not fixed or constant across grade levels. 

^The variance of the average of two variables, each with equal 
variances, will always be less than the variance of either variable 
except when p - 1.0, in which case they will be equal (see Hopkins 
and Blbelheimer, 1971 for proof). 



Table 3 



Means and Standard Deviations of Verbal, Non-Verbal, and 
Total IQ Scores for Students vrlth a Complete Set of 
Scores across Grades 1-11 (N « 186) 



Grade 


Test 


Verbal IQ 


Non-Verbal IQ 


Total IQ 


1 


CTMM X 


115.61 


106.58 


111.03 




s 


11.14 


11.87 


9.44 


2 


CTMM X 


117.45 


116.04 


116.88 




s 


15.76 


17.32 


13.20 


4 


CTMM X 


120.92 


120.98 


121.06 




s 


16.27 


17.74 


14.41 


7 


L-T X 


118.32 


117.48 


117.95 




s 


12.77 


14.25 


12.20 


9 


L-T X 


118.97 


123.74 


121.37 




s 


12.59 


11.83 


10.74 


11 


L-T X 


117.72 


122.98 


120.30 




s 


13.86 


13.70 


12.31 



16 
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FINDINGS 



Stability of Verbal IQs 

The stability and generallzablllty (over time and tests) coefficients 
for Verbal IQs on the total sample are given In Table A; the 
corresponding N’s on vhlch ther’s are based are given below the 
diagonal. For example, the correlation of .51 between Verbal IQs 
In grades one and two was based on 334 pupll^ . Notice that 
the stability coefficients are quite low at grades one and two, even 
for a one or two-year Interval. Conversely, the stability of the 
grade four verbal IQ scores are quite hl^ for all subsequent 
tests thereafter. Since the sizes of the samples on which the 
correlation coefficients are based are large, the resulting 
standard errors of the r's are small, ranging from less than .01 
(for N ■ 1813) to .05 (for N ■ 236). 

The stability coefficients In xable 4 are reduced sll^tly In 
magnitude by the fact thAt the sample Is somewhat less vatlable 
than a representative national sample (l.e., the standard deviations 
are less than 16). To estimate the coefficients for a normative 
sample, the coefficients were corrected for terminal variability 
(a •" 16) at grades 7, 9, and 11 (as done by Bloom, 1964, p. 56). 

Table 5 gives these corrected stability coefficients for the Verbal 
IQ scores. Notice that the values are .03 to .09 higher than corres- 
ponding values In Table 4. Note also that the sequence of coefficients 
over time becomes somewhat more orderly when the coefficients are 
viewed In relation to a fixed degree of terminal variability. 

The grade 11 column depicts the most relevant data since these 
correlations Indicate the relationship with a more-or-less final 
adult level of Intellectual performance. Note that the grade four 
Verbal IQ scores correlated very hl^ly with all tests thereafter. 

Table 4 

Stability and Generallzablllty Coefficients of Verbal IQ Scores 
for the Total Sample. Corresponding N's are given below diagonal. 



Verbal 


1 


1 


4 


7 _ 


9 


11 


Grade 














1 




.51 


.52 


.50 


.44 


.50* 


2 


334 




.46 


.53 


.51 


.54 


4 


263 


705 




.79 


.75 


.73 


7 


260 


620 


1194 




00 

• 


.80 


9 


253 


586 


1071 


1813 




.85 


11 


236 


517 


927 


1527 


1709 





^alue with largest standard error; .95 confidence Interval Is .40 - .59. 
^Value with smallest standard error; .95 confidence lnte7:val Is .847 - .873. 



Table 5 



St^ility and Generalizability Coefficients of Verbal 
Corrected® for Terminal Variability 



Grade Level 


7 


9 


11 


1 


.59 


.53 


.55 


2 


.61 


.60 


.62 


4 


.84 


.82 


.79 


7 




.90 


.84 


9 






.88 



^Corrected to a representative terminal variability (o = 16) , for 
grades 7, 9, and 11, Correction cannot be made on earlier tests 
since ratio IQ was involved and no accurate estimates of the population 
standard deviations available for the various grade levels. 



The degree of stable variances in IQ scores is affected by both 
changes in individuals' true scores and errors of measurement. In 
order to remove the influence of measurement errors on the stability 
coefficients, hence, estimate the relationship between true Verbal 
IQ scores over various time intervals, the stability coefficients 
in Table 4 were corrected for attenuation ( errors of measurement). 
However, since reliability estimates were not available in the 
technical manuals for certain of the tests, it was assumed that the 
geometric mean of tlie reliability coefficients associated with each 
coefficient was ,90. This value was selected as it would provide 
realistic, yet somewhat conservative dissattenuated coefficients 
since .90 was above the median reliability coefficient reported in the 
tests' technical manuals. These disattenuated correlation coefficients 
appear in Table 6. 

It is evident from Table 6 that less than half the variance in 
true Verbal IQ scores at grades 1 and 2 on group intelligence tests 
is lasting and general. It is Important that any Intei^iretatlon of 
IQ scores at grade 1 and 2 be very tentative and reflect the fact 
that substantial changes in true IQ will occur in many cases. By 
grade four however, the individual differences in true IQ scores have 
stabilized greatly and have about 80 percent common variance with 
subsequent IQ scores thereafter. 

Stability of Non-Verbal IQs 

The stability coefficients for Non-verbal IQs for the total sample 
are given in Table 7. (Tlie corresponding N*s on which the r's are 
baaed were given In Table 4.) Notice that the coefficients tend to 
be less than corresponding values for Verbal IQs shown In Table 4. 



Table 6 




' .V 



- 





, i 




s; 



Stability and Gene rail zab 111 ty Coefficients for Verbal IQ Scores 
Corrected for Terminal Varl^lllty and Attenuation* 



Grade Level 


7 


9 


11 


1 


.65 


.59 


.61 


2 


.68 


.67 


.69 


4 


.93 


.91 


.88 


7 




1.00 


.94 


9 






.98 



^Assualiig geonetrlc naan of reliability coafficlenta la .90. 



Table 7 

Stability Coefficient of Non-Verbal IQ Scores for Total Sample 



Grade 


2 


4 


7 


9 


11 


1 


.36 


.29 


.36 


.26 


.29® 


2 




.47 


.49 


.45 


.47 


4 






.61 


.59 


.56 


7 








.78^ 


.73 


9 










.76 



^alue with the largest standard error; ,95 confidence Interval la 
.17 - .40 

b 

Value with the smallest standard error; .95 confidence Interval la 
.761 - .797 
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Olkins (1967, p. 113-115) procedure to test for significance of the 
difference between the non-independent stability coefficients was 
used to test for significance of difference between corresponding 
stability coefficients for Verbal and Non-verbal IQs. The z-ratios 
were all significant (p < .05) except for coefficient involving 
grade two. All comparisons involving grade four or above were 
significant beyond the .001 level. The irregularity at grade two 
resulted from the fact that, whereas Non-verbal IQs at grade two 
were considerably more stable than those at grade one, grade two 
Verbal IQs were not wore stable than first-grade scores. 

The data support the general conclusions that observed Non-verbal 
IQ scores show even greater change over time than do Verbal IQ scores 
on group tests. This is particularly surprising in view of the 
fact that a logical study of the items suggests that they should be 
far less Influenced by environmental factors. 

The Non-verbal stability coefficients in Table 7 were corrected for 
their slightly-less-than-typical variability. Estimates of corres- 
ponding representative stability coefficients (a = 16) are given 
in Table 8 (analogous to Table 5 for Verbal IQs). Corresponding 
disattenuated coefficients (analogous to Table 6 for Verbal IQs) 
are shown in Table 9. 

Table 8 indicates that the greater stability coefficients of the 
Verbal IQ scores is not due to differential heterogeneity. Table 
9 suggests that there is less lasting general variance in the . 
Non-verbal IQ scores than in the Verbal IQ scores. Not until grade 7 
was the common variance in true scores greater than fifty percent. 

In other words, even greater caution is required with Non-verbal 
IQs than for Verbal IQs in any long term guidance or other uses. 

Both the obtained IQ scores and true IQ scores show greater change 
over time than for Verbal tests. A study of other previous research 
indicated that this finding is generally consistent across other 
tests (Hopkins, 1963; Finley et al. , 1966; Lorge et al. , 1966; 

Tyler, 1958; Meyer and Bendig, 1961; Eagle, 1966; Bradshaw, 1964; 

Hopkins, 1971). 

The Stability of Total ^ Scores . 

Since Total IQ scores on the CTMM and L-T result from averaging 
the Verbal and Non-verbal IQ scores, they tend to have sll^tly 
less associated measurement error. Consequently, their values, 
as shown in Table 10, tend to be slightly, but not meaningfully, 
higher than that found for Verbal IQs. The stability and generallzabillty 
coefficients for Total IQ scores, and the corresponding values corrected 
for terminal variability are given in Table 11. Table 12 presents 
the associated disattenuated values. 



20 



14 



Table 8 



Stability and Gene rallz ability Coefficients* for Non-verbal IQ Scores 
Corrected for Terminal Variability 



b 



Grade Level 


7 


9 


11 


1 


.40 


.34 


.33 


2 


.55 


.56 


.53 


4 


.66 


.67 


.61 


7 




.82 


.77 


9 






.79 



*See TableA for associated sample sizes, 
^o ■ 16; (see footnote a. Table 5). 



Table 9 

Stability and Gene rail zablllty Coefficients for Non-verbal IQ Scores 
Corrected for Terminal Variability and Attenuation* 



Grade Level 


7 


9 


11 


1 


.44 


.38 


.37 


2 


.61 


.62 

1 


.59 


4 


.74 


.74 


.68 


7 




.91 


.85 


9 






.88 



*A88uadng geoaietrlc mean of reliability coefficients is •90. 
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Table 10 



Stability and Gene rail z ability for Total IQ Scores 



Grade Level 


2 4 


7 


9 


11 


1 


.57 .51 


.50 


.45 


.50 


2 


.63 


.62 


.62 


.63 


4 




.77 


.74 


.73 


7 






.88 


.83 


9 








.86 



Table 11 

Stcd>lllty and Generali zablllty Coefficients for Total IQs Corrected 

to a Conunon Variability* 



Grade Level 


7 


9 


11 


1 


.57 


.55 


.56 


2 


.69 


.72 


.70 


4 


.82 


.81 


.78 


7 




.91 


.86 


9 






.89 



^Corrected to representative variability according to normative data 



Table 12 

Stability and Generallzablllty Coefficients for Total IQ scores Corrected 
for Terminal Variability and Attenuation^ 



Grade Level 


7 


9 


11 


1 


.63 


.61 


.62 


2 


.77 


.80 


.77 


4 


.91 


.90 


a 

00 


7 




1.00 


.96 


9 






.99 



*Aa8ualng geometric mean of xe&lablllty coefficients la *90. 
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Graphic Representation of Stability Coefficients 



Tlie findings pertaining to the stability and change of Verbal, 
Non-verbal, and Total IQs are presented In Figure 1, which graphically 
depicts the correlation of IQs at eacli grade level with terminal 
(Grade 11) IQ scores (corrected to a common terminal variability, 
a > 16). Corresponding data from Bloom (1964, p. 56) on an Indl^dual 
verbal test (Stanford-Binet) are also reproduced In Figure 1 to 
Illustrate the contrast In IQ stability between Individual vs. 
group tests. 

General Conclusions regardin g IQ Stability . 

Several general conclusions are evident from Figure 1 and 
associated Tables. 

1. Verbal IQs are mucli more stable than Non-verbal IQ scores, 
especially prior to Junior high school. 

2. At grades one and two Verbal and Non-verbal IQs reflect 
relatively little lasting variance, even over short Intervals. 

3. By grade four, Verbal IQs show very high relationship (about 
.8) with corresponding IQs thereafter. 

4. Non-verbal IQs at . grade seven reflect about the same degree 
of stability as Verbal IQs at grade four. 

5. Total IQs have a pattern of stability highly similar to that 
of Verbal IQ scores. 

6. IQ scores from group tests to be considerably less stable 
than Stanford-Blnet IQs prior to grade seven, after which there Is 
little differences In stability. 

7. In the main, there Is considerable change In true Verbal 
IQ scores prior to grade four, but little change thereafter. 

8. In tlie main, there Is considerable change In true Non-verbal 
IQ scores In elementary schools, bu|: only minor changes thereafter. 

Changes In IQ Scores . 

Clianges In IQ scores are reported In a different form In Tables 
13-15 where the means and standard deviations for both algebraic 
and absolute differences between IQs are given for each pair of 
grade levels. The average amount of change per student , without 
regard for the direction of the change. Is given by the mean of the 
absolute differences. Since the distribution of absolute differences 
Is positively skewed, the normal curve model should not be applied 
to estimate the proportion of students whose IQ clianged by some 
specified amount,; (Tcliebycheff 's Inequality (Hays, 1963, p. 188) can 
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Figure Ij. Graphic Representation of the Stability Coefficients for Verbal, Non-verbal, and Total IQs at Certain 
Grade Levels xTlth IQ at Grade 11 (corrected to a common variability, a = 16). Corresponding data from Individual 
Test also shown for comparison. 



be used to obtain crude, conservative estiiiuxtes of such proportions. 
For exan 5 )le, the proportion having an IQ change which is greater than 
tMo standard deviations of absolute change from the mean absolute 
change is less than •25.) 

The absolute differences beto/een IQs are influenced by both the 
instability of scores and the difference between test means at the 
two grade levels. To remove the effect of differences between 
test means, which may reflect norming artifacts, the distribution 
of algebraic differences between IQs was obtained. The standard 
deviation of algebraic differences indicates the degree of IQ 
change independent of norming imperfections and/or changes in the 
group's Intellectual elevation relative to the norming population. 

The mean and standard deviation of the distribution of algebraic 
differences allo\^ an estimate of the proportion of students whose 
IQ changed by any specific amount. Previous research (Hopkins and 
Bibelheimer, 1971) has indicated that the distributions of algebraic 
differences in IQ scores are normally distributed; hence the normal 
curve model is applicable. For example. Table 13 reveals that from 
grade one to grade two, about one-sixth of the students had Verbal IQ 
scores that decreased by 15 or more points, and about one-sixth 
had scores that increased by 15 or more points. Tlie mean change, 
irrespective of direction (absolute difference) was about 12 (11,79) 
points. As another example. Table 13 sho^/s that between grades 7 and 
11, about two-thirds of the students have Verbal IQ scores that 
differ by 9 (8,42) points or less with the mean change being about 
7 points (6.75). The Total IQ show less IQ change than either Verbal 
or Non-verbal IQs largely because a is less than 16, (See Table 2, 
especially footnote b.) The mean Total IQ change between grades 9 
and 11 was only 5 points and two— thirds of the sample had IQs 
that changed leas than 7 points during that period (see Table 15) , 

Comparative IQ Stability for Boys and Girls , 

Relatively little attention has been devoted to the comparative 
IQ stability as related to sex. Eagle (1966) reported much less 
four-year stability for boys than girls on both Lorge-Thomdike 
Verbal and Non-verbal Tests, Hopkins and Bibelheimer (1963) 
and Finley, ejt al, (1966) found no significant sex differences in 
IQ stability on the California Tests of Mental Maturity. Although 
other investigations have not explored the question statistically 
a perusal of the findings of Meyer and Bendig (1960, 1961), Doppelt 
and Bennett (1951) and Layton (1958) reveals no consistent pattern 
of differential stability by sex on other cognitive tests. 

Table 16 presents means, and standard deviations for boys and 
girls in the total sample. 

Notice that boys and girls are highly similar, both in means and 
variability. Corresponding stability coefficients for Verbal, Non- 
verbal, and Total IQs for each sex are given in Table 17, 



Table 13 



I 

i 



Means and Standard Deviations Algebraic and Absolute Differences 
Between Verbal IQs for Each Pair of Grade Levels 

1 



Grade 




2 


Algebraic Differences 
4 7 9 


11 


1 


X 

8 


- .04 
15.08 


-4.94 

14.44 


-2.14 

12.39 


-3.07 

12.72 


-2.23 

12,75 


2 


X 

s 




.52 

17.18 


.52 

15.23 


- .72 
15.49 


.66 

14,89 


4 


X 

8 






- .63 
9.96 


-2.09 

10.53 


- ,42 

10,78 


7 


X 

8 








-1.16 

7.28 


.53 

8.62 


9 


X 

8 










1.75 

7.33 


Grade 




2 


Absolute Differences 
4 7 9 


11 


1 


X 

8 


11,79 

9.38 


12.21 

9.14 


9.98 

7.62 


10.33 

8.00 


10.26 

7.87 


2 


X 

8 




13.67 

10.40 


12.03 

9.34 


12.39 

9.32 


11.93 

8.91 


4 


X 

3 






7.83 

6.18 


8.43 

6.64 


8.54 

6.59 


7 


X 

■ 8 .' 








5.76 

4.60 


6.75 

5.38 


9 


X 
' 8 










5.89 

4.70 







Table 14 



I Means and Standard Deviations of Algebraic and Absolute Differences 

I Beti/een Non-Verbal IQs for Each Pair of Grade Levels 



IV 

1 

1 


Grade 




2 


Algebraic Differences 
4 7 9 


11 




1 


X 


-8.42 


-13.64 


-10.04 


-16.73 


-16.73 


1 




8 


17.73 


18.74 


15.60 


14.77 


15.25 


1 


2 


X 




-3.28 


-4.29 


-9.84 


-8.90 


1 




8 




17.30 


16.00 


16.09 


15.91 




4 


X 






-1.47 


-6.65 


-5.81 


f 




8 






14.23 


14.16 


14.27 




7 


X 








-5.24 


-4.07 






8 








9.66 


10.47 




9 


X 










.16 






8 










9.73 










Absolute Differences 






Grade 




2 


4 


7 


9 


11 


1 


1 


X 


15.61 


18.81 


15.01 


18.80 


18.50 


j 




8 


11.88 


13.52 


10.89 


12.01 


13.03 


r*' 


2 


X 




13.81 


13.25 


15.26 


14.74 


K’ 




8 




10.91 


9.93 


11.08 


10.64 


1 


4 


X 






10.98 


12.38 


12.09 


■x 




8 






9.16 


9.56 


9.55 


1 


7 


X 








8.62 


9.02 


> 




8 








6.81 


7.36 


V* 


9 


X 










7.66 


1* 

jV 

j; 




8 










5.99 



Si 

't 

I 









O 

ERIC 



■ t 

1 

'I 

'I- 



27 



21 



Table 15 



Means and Standard Deviations of Algebraic and Absolute Differences 



Between 


Total IQs 


for Each 


Pair of Grade 


Levels 










Algebraic Differences 




Grade 




2 


4 


7 


9 


11 


1 


X 


-4.36 


-9.46 


—6.23 


•10.08 


-9.52 




8 


12.10 


12.71 


11.29 


10.80 


11.13 


2 


X 




-1.38 


-1.84 


-5.25 


-4.08 




8 




12.35 


11.73 


11.56 


11.33 


4 


X 






-1.00 


-4.31 


-3.09 




8 






9.50 


9.85 


9.77 


7 


X 








-3.21 


-2.29 




8 








6.33 


7.28 


9 


X 










.95 




8 










6.50 








Absolute Differences 




Grade 




2 


4 


7 


9 


11 


1 


X 


10.24 


12.60 


10.43 


12.24 


11.54 




s 


7.76 


9.61 


7.57 


8.27 


9.02 


2 


X 




9.60 


9.50 


10.24 


9.64 




s 




7.89 


7.12 


7.51 


7.20 


4 


X 






7.42 


8.54 


8.10 




s 






6.01 


6.53 


6.27 


7 


X 








5.56 


5.84 




s 








4.41 


4.91 


9 


X 










5.15 




s 










4.08 



22 



2 



8 



Table 16 

Means end Standard Deviations for Each Sex at Each Grade Level* 



Grade 




Verbal 






Non-Verbal 




Boys 


Girls 


Boys 


Girls 


X 


8 


X 


8 


X 


8 


X 


8 


1 


114.6 


10.7 


116.6 


11.5 


105.6 


12.1 


107.6 


11.7 


2 


117.8 


16.4 


116.2 


17.2 


113.0 


16.9 


113.4 


16.5 


4 


112.5 


16.1 


114.9 


15.2 


113.7 


17.5 


114.8 


15.3 


7 


112.9 


13.7 


115.7 


13.2 


115.0 


15.1 


115.2 


13.4 


9 


114.8 


13.0 


115.9 


13.0 


120.5 


14.2 


119.8 


13.4 


11 


113.9 


13.5 


113.3 


13.7 


121.3 


14.8 


118.6 


13.5 



^Sample sizes are given In Table 17 
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Only 9 of the 45 con^arlsons in r*s differed by enough to 
reject the null hypothesis at the .10 level (whereas about 5 would 
have been expected to by chance) . In every Instance where the 
difference was significant, greater stability was reflected by the 
girls . 

Using a binomial statistical model which is not definitive due 
to the lack of independence in the data, the girls showed greater 
stability coefficients on 9 df the 15 (60Z) Verbal IQ comparisons, 

13 of 14 (93%) comparisons in Non-verbal comparison (p < .01), and 

14 of 14 comparisons in Total IQ score (p < .01). It seems clear that 
there is a trend for slightly greater stability in the IQ scores of 
girls. The differences is so small, however, that it is of 
negligible practical significance. The differences were much less 
than those reported by Eagle (1966) , even though the same tests 

(L-T) and grade levels (4-7) were employed. 

Rel i ability of Verbal vs . Non-verbal IQ Differences . 

Interpretations of performance on Verbal and Non-verbal 
intelligence tests often consider the difference in IQ scores, as well 
as the specific Verbal and Non-verbal IQ scores. Clinical and 
educational significance is often attached to these differences (see 
Hopkins, 1964). If, these differences are unreliable over time, 
however, no lasting significance can validly be attached to a 
Verbal-Non-verbal IQ difference. Table 18 gives the correlations of 
Verbal IQ - Non-verbal IQ differences across the various time 
Intervals. Notice that the reliability of the difference in 
scores is extremely low for grade two differences, quite low for 
grade four differences, and becomes moderately stable at grade 
seven. Hence no lasting significance should be attached to differences 
in Verbal and Non-verbal IQs below grade seven, unless, of course, the 
difference is very great (e.g., 20 or more points at grade four). 

The difference in Verbal and Non-verbal IQs at junior high school 
does show considerable stability (.50 or higher) and could be of 
diagnostic value in assessing intra-individual strengths and weaknesses. 
One should also bear in mind the fact that if the coefficients in 
Table 18 were dlssattenxjated to indicate the stability of differences 
in true Verbal and true Non-verbal IQs, they would increase substantially. 
The meager lasting stable variance in Non-verbal IQ scores prior to grade 
seven no doubt accounts for the lack of lasting st^le Verbal - Non- 
verbal difference prior to grade seven. 

Associated Changes in Verbal and Non-verbal IQa. 

If maturatlonal, environmental, or situational differences 
account for much of the iidlvldual differences in both Verbal and 
Non-verbal IQ scores, changes in Verbal IQ scores would be expected to 
correlate with changes in Noihverbal IQ scores. The correlation 
coefficients in Table 19 reflect the relationship among the associated 
dianges in Verbal IQ scores with Non-verbal IQ scores. Notice that 
all the correlations are positive and most are statistically significant. 




yet low in magnitude. It can be concluded therefore, that, although 
there is a definite, but slight trend for increases or decreases 
in Verbal IQs to accompany Increases and decreases in Non-verbal 
IQ scores. The magnitude of the relationship is too small to be 
of any real predictive value, yet it is consistent with theoretical 
expectations . 



Table 18 



Stability Coefficients for Verbal - Non-verbal IQ Differences for 
Students with a Complete set of Scores in Grades 2-11. 

(n *» 440) 



Grade Level 


4 


7 


9 


11 


2 


.08 


.23 


.13 


.19 


4 




.32 


.41 


.34 


7 






.53 


.50 


9 








.55 



Table 19 

Correlation Coefficients Between Changes In Verbal IQ Scores 

Across Time with dissociated Changes in Non-Verbal IQ Scores. 

(n ■ 440) 



Grade Level 


4 


7 


9 


11 


2 


CM 

O 

• 


.12** 


.08* 


.08* 


4 




.18** 


.25** 


.22** 


7 






.13** 


.17** 


9 








.18** 



* p < ,05 (directional test) 

** p < .01 (directional test) 



Predictive Value of Multiple Scores . 

To what extent are IQ scores other than the most recent score 
an indicator of eventual Intellectual status. In practice, often 
only the most recent score is used and all prior IQ scores 
Iimored In a few situations, all available IQ scores are averag^, 
wUh the assumption that errors of measurement will ® 

averaging procedure implicitly assumes constancy of true IQ scores 
over time. A third strategy would be to employ a multiple 
r^resslon model which would optWly weight all ™ 11 ab ^ score ^ 
to maximize the prediction of a subsequent score. From a practical 
oerspectlve, the multiple regression approach suffers from * 
of simplicity and would require a convincing demonstration of superiority 
in orZr for it to offset its practicability drawbacks. 

Th*» Multiple Regression Approach. Wi t h grade eleveja ^ 

a. thf ^Sr v;;fer5.r!^ation of the most recent .cor. 

Ta^rade nine. Verbal IQ) rose from .827 to .887 when all the 15 
variables from grades 1, 2. 4, 7, and 9 were weighted optlm^ly. 
Accordingly the standard error of estimate declined 
The R-value Increased .05 with the addition of the second variable 
(grade two, Total IQ) , and an additional .02 with the addition of a 
^Hrd variable (grade four. Verbal IQ). The fourth predictor 
^«de se^. verbal) increased R by only .01. All other subsequent 

increases were negligible (.003 or less). 

Usin g grade eleven Non-verbal ig. as ^ dependent ^rlabU . the 
correlation of the most recent Non-verbal IQ TiradT^npT^t^. , 
rose to .753, .764. and .769 when grade seven Non-veAal, grade four 
Total, and grade two Non-verba were employed sequentially as 
predictors. All other variables Increased R negligibly. 

Using Total ^ the dependent variabLe, f f 

.829 f^hr^t recent Total IQ score Increased to .88 with the 
addition of the best four additional predictors. 

In conclusion, scores in addition to the most recent scores 
produced^lncrea^e in the predictability of grade ele«n IQ score., 
but the Increases were not substantial and appear to not be 
justified for most educational purposes. This pattern may have 
^om the fact that the most recent score (grade 

highly With the criterion, hence making ImproveMnt dlfflc^t^ Had 
th^question been addressed eliminating the grade nine ^he 

Lgree of improvement would no doubt have Increased coMlderably. 
Nevertheless appropriate Intellectual assessment should entail 
retesting every two or three years, hence the 9 «® 8 tlon ^ 

in a realistic manner. In addition, the issue eleven 

Addressed employing a different criterion, yet the grade eleven 
performance's the most definitive, educationally significant criterion. 
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The Use of the Average of Prior Scores as Predictors , 

Optimal wel^tlng strategies has little promise of being used 
generally, because of the Idiosyncratic natare of the results. 

However, an average of previous scores can. easily be Implemented, 

If fotnd to be fruitful means of Identifying eventual Intellectual 
status. 

Table 20 given the correlation coefficients of the average of 
all previous tests of same type (Vetbal, Non-verbal, or Total IQ) with 
various criteria. In parenthesis the correlation of the most 
recent test with the criterion Is given for comparative purposes. 

Table 20 reveals a strong trend toward the most recent score 
to being better for predicting subsequent IQ scores than the average 
of prior scores. The sole exceptions shows the average of XQs from 
grades one and two to be bet ter than grade two IQs alone In both the 
Vezbal and Total IQ areas. In all other Instances the difference 
Is negligible or the average IQ from previous Is poorer than the use 
of only the oust recent score. In some Instance, the difference 
was substantial. For example. Total grade 7 IQs alone correlated 
.88 with Total grade nine IQs. Yet when the average of previous 
Total IQ scores from grades 1, 2,4, and 7 were used, the correlation 
dropped to .76J 

It can be concluded that the averaging of other scores with the 
most recent score Is definitely not recommended (except In the rate 
situation where scores are available at both grades one and two). 

The most recent score Is the best Indicator of future performance. 

If other scores are to be used, a multiple regression equation will 
be needed so that the prediction can be Improved slightly rathei^ 
than attenuated. 

Factorial Composition of Verbal vs, Non-Verbal 11 - 

A factor analysis of Verbal and Non-verbal IQs extending over 
grade levels 1-11 was performed on the class of *69 data In order to 
obtain a better understanding of the concepttial meaning of the 
two types of IQ scores. Within the correlation matrix are two subsets 
of correlations which approached simplex structure, the Inter- 
correlatlons of Verbal IQs and the Intercorrelatlons of Non-verbal 
IQs, Cronbach (1967) has cautioned that a factor analysis of a 
correlation matrix with simplex structure should be viewed as only 
heuristic strategy, not as a method for discovering inderlylng 
developmental processes. 

A principal-axis solution was performed with squared multiple 
correlations as communallty estimates , The Harrls-Xalser method of 
oblique factor transformation was selected as it appears to have 
greatest promise for approximating slnq>le structure with factAvlally 
complex data (Hakstlan, 1971)* The Independent clusters solution 
was attempted Initially, but failed to give results as good as those 
derived from the A*A proportional to L primary factor solution (In which 



Table 20 



Correlation of the Average of Previous Teats Coapared to the 
Correlation of Only the Most Recent Test In the Set 

(In parenthesis).* 



Verbal IQ 



Grade 


Levels Averaged 




Criterion Grade 
























A 




1 




9 




1 


+ 


2 














.61 (.A6) 


.60 


(.53) 


.53 


(.51) 


.61 


(.54) 


1 


+ 


2 


+ 


A 












.73 


(.79) 


.68 


(.75) 


.7A 


(.73) 






2 


+ 


A 












.7A 


(.79) 


.70 


(.75) 


.72 


(.73) 


1 


+ 


2 


+ 


A 


+ 


7 












.77 


(.86) 


.80 


(.80) 






2 


+ 


A 


+ 


7 












.80 


(.86) 


.79 


(.80) 


1 


+ 


2 


+ 


A 


+ 


7 


+ 


9 












.84 


(.85) 






2 


+ 


A 


+ 


7 


+ 


9 












.84 


(.85) 




















Non-Verbal IQ 












1 


+ 


2 














.A1 (.A7) 


.51 


(.A9) 


.39 


(.A5) 


.A8 


(.47) 


1 


+ 


2 


+ 


A 












.59 


(.61) 


.49 


(.A9) 


.56 


(.56) 






2 


+ 


A 












.60 


(.61) 


.58 


(.59) 


.51 


(.56) 


1 


+ 


2 


+ 


A 


+ 


7 












.61 


(.78) 


.67 


(.73) 






2 


+ 


A 


+ 


7 












.69 


(.78) 


.69 


(.73) 


1 


+ 


2 


+ 


A 


+ 


7 


+ 


9 










• 


.71 


(.76) 






2 


+ 


A 


+ 


7 


+ 


9 












.7A 


(.76) 




















Total IQ 












1 


+ 


2 














.65 (.63) 


.65 


(.62) 


.58 


(.62) 


.66 


(.63) 


1 


+ 


2 


+ 


A 












.75 


(.77) 


.67 


(.7A) 


.7A 


(.73) 






2 


+ 


A 












.75 


(.77) 


.72 


(.7A) 


.73 


(.73) 


1 


+ 


2 


+ 


A 


+ 


7 












.76 


(.88) 


.81 


(.83) 






2 


+ 


A 


+ 


7 












.81 


(.88) 


.81 


(.83) 


1 


+ 


2 


+ 


A 


+ 


7 


+ 


9 












.85 


(.86) 






2 


+ 


A 


+ 


7 


+ 


9 












.8A 


(.86) 





A refers to the primary factors). The Initial A*A proportional 
to L primary factor-pattern solution was converted to the corres- 
ponding reference structure solution to facilitate the Inter- 
pretablllty of the factors. (The solution reported In Table 21 
gives the correlations of the various Verbal-Non-verbal IQ 
scores with the references axis.) 

Factor I In Table 21 Is Identified as a verbal factor which 
becomes strongly evident beginning In grade four. Factor II 
Is a Non-verbal factor which Is substantially defined by the Non- 
verbal IQs at grade seven and subsequently. Factor III appears to 
be a temporary grade one, and to a lesser extent grade two, effect. 
It may reflect transitory maturatlonal or test-taking abilities. 
Factor IV appears to reflect a transitory non-verbal factor that la 
evident In grade two, and four, and to a slight extent. In grade 
seven. 

The results In W)le 21 suggests that separate Verbal and Non- 
verbal IQs should not be reported prior to grade four, since the 
related IQ scores represent different factors than do later IQ 
scores bearing the same label. (A separate factor analysis using 
orthogonal (varlmax) protrayed an Identical pattern, yet without 
the degree of simple structure depicted In Table 21.) 

CONCLUSIONS AND RECOMMENDATIONS 



Conclusions • 



The study provides support for the following generallratlona t 

1. Contrary to most textbook treatments of the topic and 
prevailing professional Impression, there Is little or no Information 
on the stability of IQ scores from group tests especially (1) at the 
primary grades, or (2) for time Intervals greater than three 

years, or (3) on currently used tests, or (4) on Non-verbal tests. 

2. Verbal IQs are much more stable than Non-verbal IQ scores, 
especially prior to junior high school. 

3. At grades one and two Verbal and Non-verbal IQs reflect 
relatively little lasting variance, even over short Intervals. 

4. By grade four. Verbal IQs show a very hl^ relationship 
(about .8) with corresponding IQs thereafter. 

5. Non-verbal IQs at grade seven reflect about the sane degree 
of stability as Verbal IQs at grade four. 

6. Total IQs have a pattern of stability highly similar to that 
of Verbal IQ scores. 

7. IQ scores from grotq> tests to be considerably less stsble than 
Stanford-Blnet IQs prior to grade seven, after which there Is little 
differences In stability. 
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Ttble 21 

Correlations of Verbal and Non-verbal IQ Scores with Reference Axes. 



Saaple I 


Reference 
Axis I* 


Reference 
Axis II* 


Reference 
Axis III* 


Reference 
Axis IV* 


Gr. 1 Lang - CTIM 


.15 


.06 


.42 


.05 


Gr. 1 N-Lang- CTHM 


-.02 


.03 


M 


.09 


Gr. 2 Lang - CTHH 


.21 


.07 


.27 


.07 


Gr. 2 N-Lang- CTMM 


.06 


.13 


.16 




Gr. 4 Lang - CUM 




-.01 


.11 


.17 


Gr. 4 N-Lang- CTMM 


.08 


.19 


.06 


.35 


Gr. 7 Verbal - L-T 


.63 


.12 


.02 


.08 


Gr. 7 N-Verbal-L-T 


.IT 


.43 


.05 


.19 


Gr. 9 Verbal - L-T 


•u 




.00 


-.01 


Gr. 9 N-Verbal-L-T 


.17 


•iS. 


.03 


.05 


Gr.ll Verbal - L-T 


.55 


.17 


.11 


.05 


Gr.ll N-Verbal-L-T 


.15* 


.48 


.06 


.10 



*Re fleeted 
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8. In the main, there is considerable change In true Verbal IQ 
scores prior to grade four, but little change thereafter. 

9. In the main, there Is considerable change In true Non- 
verbal IQ scores In elenmtary schools, but only adnor changes 
thereafter. 

10. Differences in Verbal and Non-verbal IQs show neager 
stability over time prior to grade seven, after which there is 
considerable stability. 

11. The use of IQ scores in addition to the most recent score 
offers little promise for improving the determination of eventual 
intellectual status. Multiple regression can contribute sli^t 
improvement In prediction, wherever the averaging of prior performances 
lessened the accuracy in prediction. 

12. The factor analytic study confirmed informal observations 
made from a study of the general correlation matrix — Verbal IQ 
variance become stable at about grade four. Non-verbal IQ variance 
stabilizes at about grade seven. 



1. Group Intelligence tests not be routinely adalnistered in 
grades one and two unless the users and interpreters of thtee 

scores have received extensive training in the meaning and limitations 
of these scores. The long-term meaning of these scores is far 
below that of Individually-administered verbal intelligence tests, 
yet the typical teacher lacks the training necessary to properly 
differentially Interpret — hence an "IQ score is an IQ score." 

A reading readiness test is more useful for identifying prospective 
success and failure in grade one (Hopkins and Sitkel, 1969), and 
the consequences of misinterpretation Is far less critical. 

2. Non-language and Non-verbal IQ scores must be Interpreted 
even more cautiously than Verbal IQ scores. 

I 

I 3. Teachers and other users of IQ scores from group tests need 

to be made aware of the fact that even when there is considerable overall 
I stability, the scores of a few individuals will vary greatly. 

[ 4. No meaning should be attached to differences In Verbal 

I and Non'-verbal IQ scores prior to grade seven. 

i 

\ 

r 
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